Automatic Detection of the Prosodic Structures of Speech Utterances
Identifieur interne : 001634 ( Main/Exploration ); précédent : 001633; suivant : 001635Automatic Detection of the Prosodic Structures of Speech Utterances
Auteurs : Katarina Bartkova [France] ; Denis Jouvet [France]Source :
- Lecture Notes in Computer Science [ 0302-9743 ]
Abstract
Abstract: This paper presents an automatic approach for the detection of the prosodic structures of speech utterances. The algorithm relies on a hierarchical representation of the prosodic organization of the speech utterances. The approach is applied on a corpus of radio French broadcast news and also on radio and TV shows which are more spontaneous speech data. The algorithm detects prosodic boundaries whether they are followed or not by pause. The detection of the prosodic boundaries and of the prosodic structures is based on an approach that integrates little linguistic knowledge and mainly uses the amplitude of the F0 slopes and the inversion of the slopes as described in [1], as well as phone durations. The automatic prosodic segmentation results are then compared to a manual prosodic segmentation made by an expert phonetician. Finally, the results obtained by this automatic approach provide an insight into the most frequently used prosodic structures in the broadcasting speech style as well as in a more spontaneous speech style.
Url:
DOI: 10.1007/978-3-319-01931-4_1
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 003270
- to stream Istex, to step Curation: 003229
- to stream Istex, to step Checkpoint: 000260
- to stream Main, to step Merge: 001646
- to stream Main, to step Curation: 001634
Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">Automatic Detection of the Prosodic Structures of Speech Utterances</title>
<author><name sortKey="Bartkova, Katarina" sort="Bartkova, Katarina" uniqKey="Bartkova K" first="Katarina" last="Bartkova">Katarina Bartkova</name>
</author>
<author><name sortKey="Jouvet, Denis" sort="Jouvet, Denis" uniqKey="Jouvet D" first="Denis" last="Jouvet">Denis Jouvet</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:D4C24465BE23C39366C4630171E0613FEEEE7216</idno>
<date when="2013" year="2013">2013</date>
<idno type="doi">10.1007/978-3-319-01931-4_1</idno>
<idno type="url">https://api.istex.fr/ark:/67375/HCB-8P0RP7B1-G/fulltext.pdf</idno>
<idno type="wicri:Area/Istex/Corpus">003270</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Corpus" wicri:corpus="ISTEX">003270</idno>
<idno type="wicri:Area/Istex/Curation">003229</idno>
<idno type="wicri:Area/Istex/Checkpoint">000260</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">000260</idno>
<idno type="wicri:doubleKey">0302-9743:2013:Bartkova K:automatic:detection:of</idno>
<idno type="wicri:Area/Main/Merge">001646</idno>
<idno type="wicri:Area/Main/Curation">001634</idno>
<idno type="wicri:Area/Main/Exploration">001634</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">Automatic Detection of the Prosodic Structures of Speech Utterances</title>
<author><name sortKey="Bartkova, Katarina" sort="Bartkova, Katarina" uniqKey="Bartkova K" first="Katarina" last="Bartkova">Katarina Bartkova</name>
<affiliation wicri:level="3"><country xml:lang="fr">France</country>
<wicri:regionArea>ATILF - Analyse et Traitement Informatique de la Langue Franaise, 44 Av De La Libration, BP 30687, 54063, Nancy Cedex</wicri:regionArea>
<placeName><region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Nancy</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">France</country>
</affiliation>
</author>
<author><name sortKey="Jouvet, Denis" sort="Jouvet, Denis" uniqKey="Jouvet D" first="Denis" last="Jouvet">Denis Jouvet</name>
<affiliation wicri:level="3"><country xml:lang="fr">France</country>
<wicri:regionArea>Speech Group, LORIA Inria, F-54600, Villers-lès-Nancy</wicri:regionArea>
<placeName><region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Villers-lès-Nancy</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="4"><country xml:lang="fr">France</country>
<wicri:regionArea>Université de Lorraine, LORIA, UMR 7503, F-54600, Villers-lès-Nancy</wicri:regionArea>
<placeName><region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Villers-lès-Nancy</settlement>
</placeName>
<orgName type="university">Université de Lorraine</orgName>
</affiliation>
<affiliation wicri:level="3"><country xml:lang="fr">France</country>
<wicri:regionArea>CNRS, LORIA, UMR 7503, F-54600, Villers-lès-Nancy</wicri:regionArea>
<placeName><region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Villers-lès-Nancy</settlement>
</placeName>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="s" type="main" xml:lang="en">Lecture Notes in Computer Science</title>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: This paper presents an automatic approach for the detection of the prosodic structures of speech utterances. The algorithm relies on a hierarchical representation of the prosodic organization of the speech utterances. The approach is applied on a corpus of radio French broadcast news and also on radio and TV shows which are more spontaneous speech data. The algorithm detects prosodic boundaries whether they are followed or not by pause. The detection of the prosodic boundaries and of the prosodic structures is based on an approach that integrates little linguistic knowledge and mainly uses the amplitude of the F0 slopes and the inversion of the slopes as described in [1], as well as phone durations. The automatic prosodic segmentation results are then compared to a manual prosodic segmentation made by an expert phonetician. Finally, the results obtained by this automatic approach provide an insight into the most frequently used prosodic structures in the broadcasting speech style as well as in a more spontaneous speech style.</div>
</front>
</TEI>
<affiliations><list><country><li>France</li>
</country>
<region><li>Grand Est</li>
<li>Lorraine (région)</li>
</region>
<settlement><li>Nancy</li>
<li>Villers-lès-Nancy</li>
</settlement>
<orgName><li>Université de Lorraine</li>
</orgName>
</list>
<tree><country name="France"><region name="Grand Est"><name sortKey="Bartkova, Katarina" sort="Bartkova, Katarina" uniqKey="Bartkova K" first="Katarina" last="Bartkova">Katarina Bartkova</name>
</region>
<name sortKey="Bartkova, Katarina" sort="Bartkova, Katarina" uniqKey="Bartkova K" first="Katarina" last="Bartkova">Katarina Bartkova</name>
<name sortKey="Jouvet, Denis" sort="Jouvet, Denis" uniqKey="Jouvet D" first="Denis" last="Jouvet">Denis Jouvet</name>
<name sortKey="Jouvet, Denis" sort="Jouvet, Denis" uniqKey="Jouvet D" first="Denis" last="Jouvet">Denis Jouvet</name>
<name sortKey="Jouvet, Denis" sort="Jouvet, Denis" uniqKey="Jouvet D" first="Denis" last="Jouvet">Denis Jouvet</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001634 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001634 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Lorraine |area= InforLorV4 |flux= Main |étape= Exploration |type= RBID |clé= ISTEX:D4C24465BE23C39366C4630171E0613FEEEE7216 |texte= Automatic Detection of the Prosodic Structures of Speech Utterances }}
This area was generated with Dilib version V0.6.33. |